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ABSTRACT 

Context. Papers I and II of this series have extended the radio interferometry measurement equation (RIME) formalism to the full- sky 
case, and provided a RIME-based description of calibration and the problem of direction-dependent effects (DDEs). 
Aims. This paper aims to provide a practical demonstration of a RIME-based approach to calibration, via an example of extremely 
high-dynamic range calibration of WSRT observations of 3C 147 at 21 cm, with full treatment of DDEs. 

Methods. A version of the RIME incorporating differential gains has been implemented in MeqTrees, and applied to the 3C 147 data. 
This was used to perform regular selfcal, then solve for interferometer-based errors and for differential gains. 

Results. The resulting image of the field around 3C 147 is thermal noise-limited, has a very high dynamic range (1.6 million), and 
none of the off-axis artefacts that plague regular selfcal. The differential gain solutions show a high signal-to-noise ratio, and may be 
used to extract information on DDEs and errors in the sky model. 

Conclusions. The differential gain approach can eliminate DDE-related artefacts, and provide information for iterative improvements 
of sky models. Perhaps most importantly, sources as faint as 2 mJy have been shown to yield meaningful differential gain solutions, 
and thus can be used as potential calibration beacons in other DDE-related schemes. 
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Introduction 

The field around the bright radio source 3C 147 is a favourite 
showcase for dynamic range (DR) demonstrations. 3C 147 it- 
self is very bright (22 Jy at 21 cm), which ensures a high SNR 
for selfcal solutions, while the surrounding field boasts a spec- 
tacular collection of mostly point-like fainter sources. The ab- 
sence of significant extended emission has allowed very accu- 
rate sky models to be constructed. It is then not surprising that 
3C 147 was the f irst field to break the 10^ dynamic range barrier 
(Ide Bruvnll2QQ6l: Ide Bruvn et al.ll2Ql ll) . on a single 12-hour syn- 
thesis. This spectacular result was achieved with regular selfcal 
implemented in the NEWSTAR package. A major contributing 
factor is the relatively low leve l of beani-relat ed DDEs at the 
WSRT, as discussed in Paper II (ISmirnovll20Tll Sect. 2.lfl. An 
image of the 3C 147 field at 1,600,000: 1 dynamic range is shown 
in Fig.[T] The making of this image is the subject of Section[T] 

The very same benign properties that allow the WSRT to 
achieve record DR also make it a perfect instrument for study- 
ing DDEs. The latter are prominent enough to clearly show up 
in high-DR maps (see e.g. left inset of Fig. [T]), but not severe 
enough to hinder the building up of very deep s ky models dur- 
ing no rmal selfcaQ. The 3C 147 observations by Ide Bruyn et al.l 
(l201lh are a perfect example of this. I have therefore decided 



to reprocess these data using MeqTrees, to see if DDEs can be 
eliminated through the use of a suitable form of the RIME. 

In the presence of a dominant source (in this case 3C 147 
itself), selfcal solutions will tend to subsume all effects in the 
direction of that source. DDEs will then manifest themselves 
as artefacts around other sources, which need to be at a certain 
distance from the dominant source for the effect to become ap- 
parent. Since the dominant source is usually placed at or near 
the pointing centre (i.e. on-axis), DDE-related artefacts are also 
called off-axis artefacts. The artefacts themselves are quite clear 
(Fig.[Tl left inset): the nature of the effects responsible for them, 
far less so. At least four possibilities have been postulated: 



- Pointing errors (Paper IL [Smirnovl[2QTTl Sect. 2.1.4). 

- Differential tropospheric refraction (ibid., Sect. 2.2.4). 

- Errors in antenna positions, including non-coplanarity. 

- Other systematic coordinate errors. 



One of the objects of this study was to narrow down these 
possibilities. Section [2] therefore analyses the differential gain 
solutions obtained during this calibration, with a view to charac- 
terizing the DDEs. 



^ By contrast, even the post-upgrade Expanded VLA (EVLA), with 
its more significant DDEs, has not yet (at time of writing) exceeded 10^. 

^ Strictly speaking, this is only true in continuum mode. In spectral 
line mode, the 17 MHz "ripple" discussed in Paper II (iSmirnovii201k 



Sect. 2.1.1) becomes a very troublesome DDE. Further work is required 
on this subject. 
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Fig. 1. "Showcase" image of the field around 
the bright radio source 3C 147, produced af- 
ter reduction with MeqTrees. The image is 
noise-limited, and has a dynamic range of 1.6 
million. This DR was already achieved by de 
Bruyn using regular selfcal in NEWSTAR, but 
the resulting images contained artefacts around 
off'-axis sources (left inset) due to DDEs. A 
MeqTrees reduction incorporating diff'erential 
gains, as described in this paper, has completely 
eliminated the artefacts (rig ht inset). This 
image also appears in iNoordam & Smirnovl 

mm. 



1. Calibration approaches and results 

1.1. Observations and NEWSTAR reduction 

The observational data in question were obtained by de Bruyn in 
2003. A single 12-hour synthesis was taken, using 8 x 20 MHz 
bands (of 64 channels each) from 1300 to 1460 MHz, with 30 s 
integration time. Due to a back-end problem, one of the cross- 
correlations was corrupted, so only the XX and YY correlations 
were usable. De Bruyn then successfully reduced the observa- 
tion using the NEWSTAR package, achieving a world record 
1,600,000:1 dynamic range (Ide Bruvn et al.ll201 ll) . Only regular 
selfcal was done and no peeling was attempted, so the result- 
ing image showed DDE-related artefacts around off-axis sources 
(Fig. [T] left inset). One result of the reduction was a very deep 
NEWSTAR-format sky model for the field, containing about 300 
point sources. This provided a fantastic platform from which to 
begin my DDE study with MeqTrees. I had a ready-made sky 
model that was known to be good enough to reach the thermal 
noise with this particular dataset, and I had intermediate images 
from de Bruyn 's reduction that could be used as checkpoints. 

The same observation was repeated in 2006 wi th somewhat 
difiTerent correlator settings (de B ruyn et al.l l20Tll for details, 
see), and reduced in a similar manner. 

Eor his NEWSTAR reduction, de Bruyn self-calibrated each 
channel independently, rather than explicitly calibrating for a 
bandpass (see b elow). This procedure is described in Paper II 
(ISmirnovll201l[ Sect. 1.1). It uses the following implicit form of 
the RIME: 



= Gp{Up,^Xp,)G^, (1) 



s 

and consists of the following steps: 

1. Find Gp that minimizes \Xpq - Dpq\ in a least-squares sense. 
Compute "corrected data" as D^^ = G'^DpqG'^ . (The solu- 
tion interval here was one timeslot, one frequency channel. 
Only 56 channels in each band were usable; these were fur- 
ther averaged down using Hanning tapering, so in the end 
only 28 frequency points per band were used.) 

2. Find M^^ that minimizes |Mp^ ^ Xpq - D^^|. Compute "cor- 
rected data" as D^^ = D^^ ^ Mpq, where "-^" is element-by- 
element division - the inverse of ">!=". (The solution interval 
here was the full 12 hours, one band.) 

3. Compute "residual data" as R^^ = D^^ - Xpq. 

This procedure was repeated for each band. Residual visibil- 
ities were imaged and summed across all 8 bands, then decon- 
volved using Hogbom CLEAN. The sky model was then added 
back in using a Gaussian restoring beam. 

1.2. Calibration in MeqTrees: an overview 

In broad terms, selfcal in MeqTrees also consists of a least- 
squares fit of an equation such as ([B to the data. However, the 
following features are diflTerent: 

- The structure of the equation is not fixed: arbitrary forms of 
the RIME may be constructed. Crucially for my purposes, 
these may include DDE terms. 
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- The elements of the Jones matrices are not necessarily sim- 
ple solvable parameters (though they may be), but can be 
represented by arbitrary functions. For example, rather than 
solve for Jones (or Z- Jones) elements directly, MeqTrees 
can derive them from some model of the primary beam (or 
ionosphere) and solve for the parameters of the model. An 
example of this is given in Sect. 11.61 

- Different solvables may have different solution intervals, 
even in a simultaneous solution. 

Because of the inherent flexibility of MeqTrees, calibration 
can be pursued in a great variety of ways. During this study, 
a specific methodology was narrowed down, implemented and 
tested. This became the basis of the "Calico" calibration frame- 
work that is now included with MeqTrees. The steps (and termi- 
nology) of calibration with Calico are as follows: 

1. A desired form of the RIME is constructed, by selecting a 
sky model, and picking a series of Jones terms (plus, op- 
tionally, interferometer-based errors). For example, a form 
similar to NEWSTAR's impHcit RIME (Eq.[T]), would be: 



\lpq is usually called the "corrupted predict". 

2. "Corrupted predict" is fitted to "data". That is, the MeqTrees 
solver is instructed to find values of solvable parameters that 
minimize ID^^ - V^^l in a least-squares sense. This can be 
(and usually is) done in multiple stages, e.g. Gp first, fol- 
lowed by Mp^, etc. 

3. Output visibilities are computed as either "corrected data" 



pq 



or as "corrupted residuals" Rpq 
residuals" 



Dpq - \Jpq, or as "corrected 



= (^pq ^ ^pq)G\ 



H-l 



Subsequent imaging steps are not considered part of Calico 
or MeqTrees per se, since they apply equally to data calibrated 
using any other means, and are accomplished via external tools 
such as the Iwimager program (part of the c as are st package). 
These steps may also differ from project to project. I routinely 
image the per-band corrected residuals R^^, since these provide 
the best visual indicator of the quality of a calibration. For this 
particular study, I subsequently made 8-band residual images 
in MFS mode, using all data. The images were further decon- 
volved using Cotton-Schwab CLEAN (iSchwabt 1984). and the 
sky model was added back in using a Gaussian restoring beam. 

While Calico can produce "corrected data" (for purposes of 
imaging, etc.) at any step of the reduction, it does not use it as in- 
put for subsequent stages like NEWSTAR and other 2GC pack- 
ages do. The fitting at step 2 is always done using the original 
observed data Dpq, and calibration consists of building up the 
RIME until the "corrupted predict" fits the observations. Once 
DDEs enter the picture, "corrected data" (in the conventional 
understanding of data corrected for instrumental errors) do not 
really exist any more, since visibilities can only be corrected for 
the value of a DDE in a particular direction. Hence the Calico 
philosophy is to work with the original data at all times. 



1.3. Bandpass selfcal 

A per-channel, per-timeslot solution for Gp in Eq. ([T]) has 2N 
complex unknowns, and N(N-1) complex measurements, where 
N is the number of stations (14 for the WSRT). While an ac- 
ceptable ratio (this is, after all, why selfcal works in the first 
place!), it does not leave a lot of room for introducing DDE- 
related parameters. In general, we want to allow our solutions as 
few DoF's as possible, and making the solution intervals (in time 
and/or frequency) larger is one way of ensuring this. 

The WSRT has a reasonably stable bandpass, so an obvi- 
ous way to reduce the parameter count is to separate Gp into 
a bandpass component to capture the frequency structure (with 
little to none variation in time), and a frequency-independent, 
rapidly varying complex gain. Per each station/receiver, this re- 
places A^chan X Mime parameters with only A^chan + Mime of them. 

For the MeqTrees reduction, I therefore started with the fol- 
lowing RIME: 



Here, X 



GpBp 



(2) 



spq^ 



Gp and M^^ have the same meaning as in 



Or at most pre-averaged. 



the NEWSTAR equation above (including a similar smearing 
correction term in X^^^). The Bp term is a second diagonal 
Jones matrix representing the bandpass. Note that MeqTrees it- 
self makes no special distinction between G and B. Both are 
generic diagonal complex Jones matrices, with solvable real and 
imaginary parts. It is only when we specify the solution intervals 
that these Jones terms acquire their intended meanings: 

- The Gp solution interval is 30 s, all channels (thus, one inde- 
pendent solution every timeslot, per the entire band). 

- The Bp solution interval was initially set to 30 minutes, and 
one channel (thus, one independent solution every channel, 
per 60 timeslots). Note that as the bandpass has significant 
structure, I did not attempt to fit it with any smooth function, 
but rather allowed each channel to be fitted as an independent 
parameter, with a timescale of 30 minutes. The latter interval 
was intended to accommodate slow drift in the bandpass. 

The initial result of this calibration was profoundly unsatis- 
factory (Fig. [21 upper left). The residual image was dominated 
by spoke-like artefacts centred on 3C 147, at about 10,000:1 
level (relative the flux of 3C 147 itself). These spokes correspond 
to edges of the 30-minute solution intervals (being an E-W ar- 
ray, WSRT has a one-dimensional instantaneous PSF). The ob- 
vious explanation for the error is short-term bandpass instability. 
Decreasing the solution interval of Bp to 15 and 7.5 minutes re- 
duced the artefacts to levels of 50,000:1 to 100,000:1, but did 
not eliminate them entirely. Smoothing the 7.5 minute solution 
with a spline (along the time axis, per each channel indepen- 
dently) produced a marginal improvement (Fig.[2l lower right). 
Subtraction artefacts are still plainly visible in the map, although 
at a level not significantly above thermal noise. 

At this stage I had to conclude that the WSRT bandpass ex- 
hibits some very low-level, but extremely short-term jitter, pre- 
cluding a separate bandpass selfcal at extreme dynamic ranges. 
On the other hand, this result also shows that where a single- 
band dynamic range of no more than 100,000 is expected (as is 
the case for many other observations), bandpass selfcal provides 
perfectly adequate results, and can cut down on the number of 
solvable parameters significantly. In the meantime, for the 3C 
147 study I had to revert to the per-channel selfcal approach of 
de Bruyn. 



4 



O.M. Smirnov: Revisiting the RIME. III. Addressing DDEs in WSRT observations of 3C 147. 




Fig. 3. Single-band residual images produced via per-channel 
selfcal. The left image is the result of solving for Gp. It is 
dominated by concentric rings centred on 3C 147 (designated 
as "A 150" here). These are caused by closure errors, and go 
away once a solution for interferometer-based errors M^^ is done 
(right image). The remaining artefacts are associated with off- 
axis sources B, C and D, and are due to DDEs. 



Fig. 2. Single-band residual images produced via bandpass self- 
cal with different solution intervals for Bp\ 30 minutes (upper 
left), 15 minutes (upper right), 7.5 minutes (lower left), and with 
the 7.5 minute solution smoothed using splines (lower right). 
Even in the best-case image, the dominant source 3C 147 was 
not subtracted out perfectly, leaving behind DR-limiting arte- 
facts. 



1.4. Per-channel selfcal 

The RIME for per-channel selfcal is just Eq. ^ without the 
bandpass term. It is, in fact, very similar to NEWSTAR's im- 
plicit equation ([TjQ 



20' away from centre. The artefacts themselves are under 50yuJy 
(thermal noise in one band being ~ 30yuJy), or at a level of 
1:1000 relative to the associated sources. This is why lde Bruynl 
( 2006) talks about an "off-axis dynamic range" of a 1000: while 
3C 147 itself (22 Jy) is subtracted without a trace (down to 
the thermal noise), the off-axis sources are only subtracted to 
a precision of about 1000. Some of the artefact structure is 
doubtlessly due to slightly under- or overestimated sky model 
fluxes (this produces regular rings matching the WSRT PSF), 
which can be taken care of during subsequent deconvolution. 
Most of it, however, is due to DDEs and does not deconvolve, 
producing artefacts in the final 8 -band images such as the one 
shown in the left inset of Fig. [T] (the inset is, in fact, a close-up 
of source B). 



^pq = >^ 



(3) 1.5. Interferometer-based errors 



Per-channel selfcal is achieved by setting the solution inter- 
val of Gp to one channel and one timeslot. The resulting single- 
band residual images are dominated by closure errors at a level 
of ~ lOOyuJy (or 1:200,000 relative to 3C 147 itself); these go 
away after an M^^ solution (Fig. [3]). These images are quali- 
tatively very similar to those obtained by de Bruyn during his 
NEWSTAR calibration (which is to be expected, given the sim- 
ilarity of the equations). 

The remaining artefacts in Fig. [3] are associated with the 
three next-brightest source^ in the field: B (42 mJy), C 
(52 mJy), and D (22 mJy). The furthest of these (B) is about 



^ With the exception of the position of the M^^ term, which is on the 
inside of Eq. ([T]), and on the outside here. For this particular dataset it 
makes no difference: since only the XX and YY correlations are used, 
all matrices in Eq. ([T]) are diagonal, and for diagonal matrices the 
operator is equivalent to matrix multiplication, and commutes. For the 
full-polarization case, the two approaches are not equivalent. 

^ The source IDs used here follow the "COPART" (Clustering, 
Order, Position Angle, Radius, Type) convention, as implemented by 
the Tigger sky model management tool (available with MeqTrees). A 
COPART source ID starts with an alphabetic designator (A, B, ... Z, 
aa, ab, ...) assigned to sources in order of decreasing brightness. This is 
already a unique identifier, and is sometimes used by itself for brevity, 
as in the paragraph above. A full ID also encodes approximate posi- 
tion relative to field centre: two digits for the position angle (in units 



It is not clear what causes closure errors at the WSRT. Common 
sense suggests the analogue part of the signal chain is to blame, 
but there is no hard evidence either way. What is evident is that 
high-DR images exhibit concentric ring-like artefacts such as 
those in the left image of Fig. [3l and that these go away once 
a solution for an interferometer-based multiplicative error - the 
Mpq term of Eq. ([3]) - is applied. A single solution per band, per 
the entire 12 hours (per correlation and interferometer) is suffi- 
cient. The Mpq solutions are usually within 10"^ - 10"^ of unity 
(as is the case here), but can be much higher in some observa- 
tions, for reasons that remain mysterious. 

The latter fact suggests an intrinsic time variability, but solv- 
ing for Mpq on short time intervals is very dangerous. Any so- 
lution for Mpq will also try to compensate for observed flux that 
is not present in the sky model. Unless the solution interval is 
sufficiently long, there will be unmodelled sources with a fringe 
rate slow enough that their vector average visibility over the so- 
lution interval will be significantly non-zero. These sources will 
then tend to be attenuated by the M^^ solutions. The 3C 147 ob- 
servations provide a perfect example (Fig. [4]). On the left is an 
8 -band residual image with a 12 hour M^^ solution; on the right 



of 10°), and one digit for radial distance (in units of 10'). Optional suf- 
fixes indicate source type and clustering. Thus the full ID of source B is 
B232; being slightly extended, in this particular sky model it is actually 
represented by a cluster of six delta functions: B232, B232a, ... B232e. 
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Fig. 4. Source suppression through interferometer-based error 
solutions. On the left is a deconvolved 8 -band residual image 
of the centre of the field, with 12 hour solutions for M^^. On 
the right is the same image with 30 minute solutions. The po- 
sitions of (subtracted) model sources are indicated by crosses. 
Suppression of unmodelled sources is evident in the right im- 
age. 



is one with a 30 minute solution. Model sources are indicated by 
crosses. Attenuation of unmodelled sources towards phase cen- 
tre is clearly visible in the right image. 

This implies that closure errors cannot be reliably solved for 
on observations shorter than 12 hours, unless a "complete" (i.e. 
down to the noise) sky model of the center of the field is avail- 
able. 



1.6. Pointing self cal 

Since the main cause of artefacts in WSRT images is com monly 
considered to be pointing error (see Paper II, Smirnovt 120111 
Sect. 2.1.4), I decided to i mplement a form of the pointing self- 
cal algorithm suggested by'Bhatnagar e t al.l (12004 *). This proved 
to be a straightforward exercise in MeqTrees, since only a small 
modification of the RIME of Eq. ([3]) was required: 



^pq = ^pq^\ Gp 



^ Esp^spq^sq 



7H 



(4) 



Instead of a per- source beam gain Es = E(ls, rris ) (where 
E(l, m ) is the cos^ approximation given by Paper II, ISmirnovl 
I2011L Sect. 2.1.1), which I had been using in the previous equa- 
tions, here I used a per- antenna, per-source beam gain Esp, de- 
fined as follows: 



Esp = E(ls + Sip, Ms + Snip) 



(5) 

were then treated as 



Per- antenna pointing off'sets dip. Snip 
solvable parameters. 

The results of this proved inconclusive. Even though the so- 
lution converged to some physically-sensible off'sets (on the or- 
der of arcmin), no tangible reduction of artefacts was observed 
in residual images. This could be due to a number of reasons: 

1 . The cos^ approximation is not good enough - unlikely, as it 
has been independently verified at least for the core of the 
main lobe, which the sources in question sources are well 
within. 

2. With only a few sources sufficiently bright to exhibit DDEs, 
we simply don't have enough constraints for a pointing so- 
lution on this field. 

3. The model fluxes/positions for the sources in question are 
not sufl&ciently accurate. 



Fig. 5. Results if the flyswatter. On the left is a single-band resid- 
ual image after Gp and M^^ solutions only. On the right is the 
same image with differential gain solutions for sources B, C, and 
D. 



4. The dominant DDE affecting this observation is not pointing 
error. This will be elaborated on further in Sect. [2j 

5. There is something wrong with my implementation of point- 
ing selfcal, especially since the figures in Sect. [2l suggest that 
mispointings are detectable. 

Trying to get a better grip on the problem, I eventually set- 
tled on a "controlled experiment": locating a field containing 
multiple bright off-axis sources, and observing it with delib- 
erately exaggerated mispointings, to see if these can be more 
readily recovered from the data. This experiment became known 
as the "QMC Project" (in honour of the long-defunct Quality 
Monitoring Committee of WSRT), and was successfully carried 
out. The results of this are still being processed, and will be pre- 
sented in a follow-up paper. In the meantime, I had to look for 
alternative approaches to DDEs in the 3C 147 field. 

1.7. Differential gains: the "flyswatter" 

In the spirit of "ph enomenological" equations discussed in Paper 
II (ISmirnovl201ll Sect. 1.3), I decided to introduce a differential 
gain Jones (AE"- Jones) into my form of the RIME: 



ypq = * 



Gn > AEsnEsXs 



E^AE 



H 



(6) 



The AEsp term is meant to subsume all DDEs associated 
with source s and antenna p (with the exception of the nominal 
primary beam gain, which is already represented by Eg). Solving 
for this term requires some caution, lest too many DoF's be in- 
troduced into the equation. I approached this as follows: 

- A^ was fixed at unity for all sources except B, C, and D; 

- For B, C and D, A^ was set to a diagonal matrix with solv- 
able complex elements. 

- The solution intervals for the A^ elements were set to one 
solution per 30 minutes, per entire band (and per source, an- 
tenna, receptor). 

Figure [5] shows the effect of A^ solutions. All artefacts as- 
sociated with sources B, C and D disappeared completely, to the 
point where artefacts around fainter sources became just about 
visible. These were later eliminated once A^ solutions were 
enabled for those sources. In fact, any source to which a solv- 
able A^ term was assigned promptly vanished from the residual 
maps, hence my name for the differential gains approach: the 
"flyswatter" algorithm. 
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ID 



position total flux 



B232 
C270 
D141 
F012 
H283 
K063 
ae317 



20' SW 
4' W 
14' SE 
25' N 
37' W 
31' NE 
74' NW 



42.3 mJy 
51.8 mJy 
21.8 mJy 
12.3 mJy 
9.4 mJy 
8.2 mJy 
2.2 mJy 



Fig. 6. Positions (relative to nominal pointing centre) and aggre- 
gate fluxes (apparent) of the seven oflT-axis source clusters for 
which solutions were obtained. Circles are at a radius of 30' 
and 1°. For reference, the FWHM of the WSRT voltage beam is 
~ 50' at 1.4 GHz. 



Once 8 -band residual images were created, the increased 
sensitivity made it apparent that four more sources were exhibit- 
ing a small amount of DDEs. A more in-depth look at the sky 
model also showed that all seven sources were in fact slightly 
extended; the NEWSTAR model represented each by a tight 
cluster of point sources. Since all "sources" in such a cluster 
are subject to the same DDE, it seemed sensible to make sure 
the same A^^^ term was applied to all of them. This was easily 
done using Tigger, a sky model manager/viewer tool included 
with MeqTrees. Tigger automatically detects source clusters and 
assigns source identifiers appropriately (e.g. B232, B232a, ... 
B232e) It was then a simple matter to tell the Calico framework 
to use the same AEs^p term for all sources s associated with clus- 
ter ^0- 

The seven source clusters for which diflTerential gain solu- 
tions were eventually obtained are summarized in Fig. (6] Two 
of them are somewhat noteworthy. Source C270 is very close to 
centre, and therefore shouldn't be aflTected by DDEs as much as 
the other sources. It is, however, a complicated and highly po- 
larized source, so perhaps the artefacts it exhibits after regular 
selfcal are primarily due to sky model inaccuracies, which the 
A^ solutions absorb (see discussion in Sect. 12. II) . Source ae317 
is almost the opposite: it is very faint, but far enough oflT-axis to 
be in a sidelobe of the primary beam, and so subject to especially 
severe DDEs. 



1.8. The showcase result 

The ultimate result of my calibration of the 2003 observation is 
shown in Fig.[T] This image is a true showcase for the diflferential 
gains approach. The precise steps leading to this image were as 
follows: 

1 . Each of the 8 bands was independently calibrated using per- 
channel selfcal, interferometer-based errors, and A^ solu- 
tions on seven source clusters, as described above. Corrected 
residuals were generated. 

2. The residuals for all 8 bands were imaged together (in MFS 
mode) to produce a single residual image. This revealed a 
large number of fainter sources not visible in the per-band 
maps. 

3. The 8-band image was deconvolved using Cotton-Schwab 
CLEAN (Schwab 1984). 

4. The sky model was added back into the deconvolved image, 
using a Gaussian restoring beam. 



The resulting image is completely artefact- free. Presumably, 
all other sources in the field are too faint to exhibit any DDE- 
related artefacts. With a dynamic range of 1,600,000:1, this im- 
age is the deepest and cleanest single- synthesis radio map in the 
world to date. 



1.9. Flyswatter limitations 

While it can help produce spectacular images, the flyswatter has 
some serious caveats and drawbacks that need to be explored. 
First of all, it is a brute-force approach, in the sense that it 
squashes all eflTects into a single A^ term. This includes inac- 
curacies in the sky model! Indeed, any missing source flux or 
error in source position can be accommodated with a suitable 
A^. Even unmodelled source structure can "leak" into diflferen- 
tial gain solutions (Sect. 12.11) . Thus, diflTerential gains are good 
for subtracting sources, but at the cost of mashing up informa- 
tion on the source per se. (One does not use a flyswatter to probe 
a fly's anatomy!) 

Secondly, solvable diff'erential gains can lead to a prolif- 
eration of DoF's. Per-channel selfcal has A^ant unknowns per 



A^ant(A^ant-l) 



measurements, or - 



Y unknowns per measurement; 
unknowns per measurement. 



difl'erential gains add ... ,.^r 
where A^time and A^freq are the sizes of the solution interval for 
A^. This ratio remains favourable for small A^src and large A^time 
and/or A^^eq (as is the case for my 3C 147 reduction), but one 
must be careful. 

The third caveat is processing cost. While usually not as ex- 
pensive in terms of I/O or CPU as peeling (which, in addition 
to the solutions themselves, requires repeated subtraction and 
phase shifting steps), the flyswatter is not free. Every source with 
a difl'erential gain solution adds 4A^ant unknowns (assuming a di- 
agonal complex AEsp term, hence 4 real values per matrix) to 
the equations. As the number of unknowns (A^unk = 4A^antA^src) 
grows, inversion of the normal matrix within the least- squares 
solver becomes a CPU bottleneck, since it scales as ^(A^^^k)- 
This makes it impractical to solve for A^'s for more than a hand- 
fuQof sources at a time. 

One way to mitigate the solver bottleneck is to decompose 
the AEsp's into nearly-orthogonal sets of unknowns. For exam- 
ple, we can treat the set of AEsp's associated with one source 
.s- as independent from all other sources. The (4A^antA^src)^ nor- 
mal matrix inside the solver then becomes block-diagonal, com- 
posed of A^src blocks of size (4A^ant)^- Inversion of this matrix then 
scales as 0(Nsrc)0(Nl^^). This scheme was tested in MeqTrees, 
and it was found that the trade-oflf is slower convergence, requir- 
ing more iterations. For large numbers of sources, however, this 
becomes very favourable. 



2. Analysis of differential gain solutions 

It is time to see whether any useful information can be gleaned 
from the diflferential gains solutions themselves. As a result of 
the reduction, I had obtained: per each source direction (7 of 
these: see Fig.O, per each antenna (14), per each band (8), per 
30-minute interval (24 of these in a 12-hour synthesis), two com- 
plex numbers representing the apparent diff'erential gain of the X 
and Y dipole ("diflferential" being relative to the gain in the di- 



^ The precise meaning of a "handful" here depends on additional fac- 
tors such as A^ant, size of solution intervals, etc. In effect, these factors 
influence the constant of the overall cubic scaling law. 
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rection of 3C 147 - almost at the centre of the field - which had 
been taken care of by regular selfcal). 

I then adopted the following approach. Given the relatively 
low fractional bandwidth, I didn't expect much variation with 
frequency in A^. I therefore treated each set of 8 per-band so- 
lutions as independent samples of the same variable. The mean 
of the 8 samples was used as an estimator of variable, and the 
standard deviation of the 8 samples as an estimator of the error 
(i.e. the error bar). 

Since it quickly became apparent that the A^ solutions were 
exhibiting some very interesting behaviour, I applied exactly the 
same procedure to the 2006 observations, so that comparisons 
could be made. The plots below show the results from both ob- 
servations. 

Figure |7]is a summary of the diff'erential gain-amplitudes per 
source, per antenna. The precise quantity plotted here was com- 
puted as follows. First, I computed the norm of the A^ matrix 
(diagonal by construction) as 



a 
b 



I then normalized (divided) this value by the mean value per 
source (that is, the mean across all time intervals, bands, and 
antennas), which was meant to take out the eff'ect of incorrect 
model fluxes (see below). The resulting "normalized norm" was 
then plotted as a function of time, per source, per antenna. 

Figure[8]is a similar plot of the difl'erential gain-phases, com- 
puted as 



a 0\ SiTga-\-SiYgl 
Mob^ 2 



The most striking feature of Figs. [7] and [8] is the high SNR. 
They show a high degree of temporal continuity in the solutions, 
and statistically significant structure. This strongly suggests that 
the solutions represent real physical or numerical eff'ects. As to 
the nature of these eff'ects, I still do not have satisfactory an- 
swers, though it is hoped that the "QMC Project" mentioned ear- 
lier will shed some more light on the issues. The rest of this sec- 
tion discusses some of the more prominent questions, and pro- 
poses some rather speculative explanations. 

2.1. Absorbing errors in the sky mode! 

As already mentioned, a major caveat of the fly swatter approach 
is that A^ solutions will tend to absorb inaccuracies in the source 
model. By analogy (and for exactly the same reasons), clas- 
sic selfcal alone cannot solve for absolute positions or fluxes. 
Indeed, if the true position of a source / = (/, m) is ofl'set from 
the model position f^^"^^ = l-{-Sl = {l + dUm + dm), while the true 
brightness B difl'ers from the model brightness by a multiplica- 
tive matrix factor: B^"^°^^ = ABA^ (the latter being a straight- 
forward generalization of a scalar factor a^), then the coherency 
term of the RIME for the model source may be written out as 



\/(mod) 

pq 



= Kp{dl)Kp{l)AEA''K^{l)K^m 

= [Kp(Sl)A] . [KpiDSKfiD] . [K,(Sl)Af 

= [Kp(Sl)A]-Xp,-[K,(Sl)Af. 



If a solvable difl'erential gain is then assigned to the source, 
the model can be made to fit the data by absorbing the Kp(Sl)A 
factor into the A^^^ solutions. 

Even more insidiously, diff'erential gains can absorb some 
source structure. Consider a source that is slightly extended in 
one direction, enough to be resolved on the longest baselines. An 
E-W array like the WSRT has a one-dimensional instantaneous 
fan beam. It will "see" the source as a point source when the fan 
beam is aligned with the source orientation, and start resolving 
it when the fan beam becomes perpendicular to the source. In 
other words, the apparent flux of the source will remain constant 
in time on short baselines, and vary in time on the long base- 
lines as the source resolves. If such a source is represented by a 
point source in the sky model, the model flux will be constant 
on all baselines. Now, if some antennas are predominantly in- 
volved in long baselines (RTC and RTD, in the case of WSRT), 
A^ solutions can compensate for some of the flux discrepancy 
by changing the gain-amplitudes of these antennas. I would ex- 
pect to see a variation of ||A^|| with a 12-hour period. Since most 
of the baselines to RTC and RTD are mutually redundant (0-C 
equals 1-D, etc.), their variation in ||A^|| should be very similar. 

This is exactly what we're seeing in Fig. [7]! The plots very 
strongly suggest that the top three sources (B, C and D) are in- 
deed slightly extended (more so than in the model, that is). If this 
is the case, then the dominant contribution to ||A^|| on antennas 
RTC and RTD is due to source structure rather than any actual 
DDE. 



2.2. Amplitude behaviour 

If the behaviour of difl'erential amplitude on RTC and RTD is due 
to source structure, this still leaves efl'ects on the other antennas 
unexplained. First let us consider what a pointing error would 
look like. With the exception of source ae317 (for which the so- 
lutions are much too noisy anyway), the other six sources are 
well within the main lobe of the primary beam, where the beam 
gain can be expected to decrease smoothly with distance from 
pointing centre. We should therefore expect to see HA^^^H > 1 
if antenna p mispoints towards source s (the source appears 
brighter on antenna /?), and HA^^^H < 1 if it mispoints away. 
WSRT dishes are equatorially mounted, so mispointings due to 
mechanical or electronic errors in either axis drive would be sta- 
tionary with respect to the sky, and thus cause constant ||A^|| 
ofl'sets. Mispointings due to wind pressure, thermal or gravi- 
tational deformation, on the other hand, would be intrinsically 
time- variable (but would perhaps correlate between adjacent an- 
tennas). 

Quite a few plots in Fig.|7]do show (mostly) static ofl'sets. It 
can be illuminating to present ||A^|| in a format I call the "rogues 
gallery" (Figs. l9l and [TOl). This shows, for each of the 14 anten- 
nas, a 12-hour average ||A^|| per source, using circles of vary- 
ing size placed at the position of the source. The magnitude of 
(IIA^II - 1) is indicated by circle size, and the sign by colour. A 
static mispointing in, e.g., a Northern direction would show up 
as blue circles in the top half of the plot (i.e. sources appearing 
brighter), and red circles in the bottom half. 

The galleries show exactly such a pattern for antennas RT5 
through RT8 (and perhaps RT4), for both the 2003 and, to a 
lesser extent, the 2006 observations. It is a little bit strange that 
4 (or even 5) adjacent antennas would so consistently mispoint 
North, and do the same three years later. Perhaps this is another, 
poorly understood consequence of unmodelled source structure. 
Some pointers to this are that antennas RT4-8, being in the mid- 
dle of the array, form up predominantly shorter baselines, and 
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Fig. 7. Differential gain-amplitudes (||A£'||) as 
a function of time for the 2003 (top) and 
2006 (bottom) observations. Rows correspond 
to sources, columns to antennas. The vertical 
plot scale is fixed within each row, but diff'ers 
from row to row. Horizontal lines indicate the 
IIA^II = 1 level. 
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Fig. 8. Diff'erential gain-phases (argA£^, in de- 
grees) as a function of time for the 2003 (top) 
and 2006 (bottom) observations. Rows corre- 
spond to sources, columns to antennas. The ver- 
tical plot scale is fixed within each row, but dif- 
fers from row to row. Horizontal lines indicate 
the arg AE = level. 



that the long-baseline antennas RTC and RTD exhibit the op- 
posite behaviour. (What hinders such an analysis is the unfortu- 
nate fact that the three brightest oflT-axis sources all exhibit some 
structure, and all three lie in the bottom half of the field.) Another 
puzzling feature is the consistently low ||A^|| for sources F, H 
and K on antenna RTC in 2003 (and to a far lesser extent in 
2006). If due to source structure, why does it not repeat on RTD? 
Perhaps RTC is mispointing to the South? 

Antennas RTO-2, RT9 and RTA, on the other hand, show 
completely diflferent patterns, with little to no similarity between 
2003 and 2006. Some of these are consistent with a static mis- 
pointing. Some antennas (RT8 and RT9, and RTB especially) 
also show a hint of time variability in ||A^||. 

In any case, it is clear that the complicated interaction be- 
tween source structure and diflferential gain-amplitudes makes 
the latter extremely difl&cult to interpret. Note also that my 
(or rather de Bruyn's) source model was built by NEWSTAR 
based on regular selfcal, so there's bound to be some contamina- 



tion from DDE-related artefacts in the source parameters. Truly 
robust methods for disentangling source structure from DDEs 
have yet to be developed. It is also clear that an approach that 
parametrizes the DDEs in a "global" way, such as pointing self- 
cal, is the way forward - what is not yet clear is how much the 
global solution itself can be affected by unmodelled structure in 
the brighter sources, and what to do about it. 

2.3. Phase behaviour 

A rather prominent feature of the phase plots in Fig. [8] is their 
continuity from antenna to antenna, and the fact that the phases 
at the two ends of the array exhibit opposite temporal trends. 
This is suggestive of an evolving phase slope over the array. 
Fitting such a slope (per source) produces some very striking re- 
sults (Fig.[TT]). The dominant phase eflfect is clearly global rather 
than antenna-based, and is extremely consistent across both ob- 
servations. 
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Fig. 9. "Rogues gallery" plot for the 2003 observation. This 
shows the 12-hour average ||A^|| per source, as seen by each an- 
tenna. Blue circles correspond to values of ||A^|| > 1, red circles 
to values of ||A^|| < 1, and areas are proportional to | ||A^|| - 1 1. 
Line thickness indicates the statistical significance of | ||A^||- 1 1; 
filled circles are for detections of over 3cr. The large grid circle 
is at radius 30'. 

Average ||dE||, RTO Average ||dE||, RTl Average ||dE||, RT2 Average ||dE||, RT3 Average ||dE||, RT4 



Average ||dE||, RT5 



Average ||dE||, RT6 



Average ||dE||, RT8 



rage ||dE||, RTA Average ||dE||, RTB Average ||dE||, RTC Average ||dE||, RTD 

O 



Fig. 10. "Rogues gallery" plot for the 2006 observation, usin^ 
the same scale as Fig. [9] 
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Fig. 11. Phase slopes over the array as a function of time (in 
deg/km) in the direction of the seven sources for the 2003 (top) 
and 2006 observations (bottom). The green lines indicate phase 
slopes corresponding to the fitted position off'sets (Fig. [121), the 
red lines - phase slopes corresponding to an overall field rotation 
of 45''. 
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Fig. 12. Fitted position off'sets corresponding to the phase slopes 
of Fig. [TT] (2003 observation on the left, 2006 on the right). 
The length of the arrows is exaggerated by a factor of 1200: the 
biggest oflfset is in fact just under 1". 



observation), is, to first order, consistent with a rotation of 45" 
around the phase centre. The corresponding slope curves are 
plotted in red. While there are some significant diff'erences in 
the brighter sources, it seems clear that the dominant eff'ect is 
not an instrumental DDE at all, but a systematic rotation of the 
sky model. The model positions are derived by NEWSTAR from 
direct fits to the visibilities, and de Bruyn (priv. comm.) has inde- 
pendently cross-checked the positions of distant sources against 
the NVSS, which seems to preclude a rotation in the model itself. 
Note that a 45" rotation can also be introduced by a clock error 
of about 2.9 s, or a corresponding rotational error in conversion 
of uvw coordinates from apparent to J2000. Since NEWSTAR 
and MeqTrees use completely diff'erent tool chains and visibil- 
ity data formats, I cannot exclude a coordinate conversion error 
somewhere along the line. This needs to be urgently investigated. 
If indeed the entire sky model is slightly rotated, then perhaps the 
image of Fig. [T] can even be improved upon! 

2.4. Feeding differential gains back into the sky model 

The results above suggested that I could improve my sky model 
by feeding back in some information extracted from the A^ so- 
lutions. In the previous section, I obtained a correction to the 
model positions of the seven sources Following the discussion 
of Sect. 12.11 I could also provide corrections for the / and Q 
fluxes by applying the per- source average A^ amplitudes: 



Bf^''^ = |A^,|B,|A^, 



\ae;\ . ' 1^^- 



\Ae. 



where ti and vj represent the time and frequency solution 
intervals of A^. In terms of the / and Q fluxes, the correction 
becomes: 



A phase slope over the array can be interpreted as an appar- 
ent position oflfset. It appears that the slope behaviour in Fig.fTTI 
can be fitted quite well by constant position off'sets. The best- 
fitting position ofl'sets are indicated in Fig. [121 and the corre- 
sponding slope curves are plotted in green on Fig.[m 

Figure [121 immediately suggests a field rotation. And indeed, 
the entire collection of phase slopes (for both the 2003 and 2006 



/(corr) = 2 . / + A . 2, Q^^^^) = A • / -h E . e. 



^ For the moment, I've left aside the issue of whether the positional 
offsets are ultimately due to a global field rotation. Improving the posi- 
tions of seven of the brightest off-axis sources should already produce 
an superior sky model. 
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Fig. 13. Calibration with an improved sky model. This shows 
single-band residual images after Gp and M^^ solutions. The left 
image is from the original reduction, the right image uses a sky 
model improved via my analysis. Crosses indicate the posi- 
tions of sources for which the model was improved, plus 3C 147 
itself (source A). 



I therefore applied these corrections for /, g, and position to 
my sky models (independently for the 2003 and 2006 observa- 
tions), and repeated the calibration procedure. An improvement 
in single-band residuals was immediately apparent (Fig. [13]) - 
after Gp and M^^ solutions, the seven off-axis sources subtracted 
noticeably better. Residuals after AEsp solutions, on the other 
hand, looked pretty much the same (this is not surprising, since 
differential gains had already taken care of the visible off-axis 
errors in the original reduction), with a very slight improvement 
around 3C 147 itself, which can be explained by improved Gp 
solutions due to the more accurate sky model. 



2.5. Phase behaviour II 

Presumably, the remaining residual structures in Fig. [13] are 
more representative of the instrumental DDEs per se, since in- 
accuracies in the sky model have been significantly reduced. We 
should also expect the differential gain solutions to be more in- 
dicative of the actual DDEs (apart from the issue of resolved 
sources affecting ||A^|| on antennas RTC and RTD, which the 
improved sky models do not address at all). Of particular interest 
is the effect that the improved model has on the differential gain- 
phases. As for the gain-amplitudes, we would expect them to 
differ by only an overall per- source scaling factor. Indeed, mak- 
ing the same ||A^|| plot as in Fig. [7] confirms this - it is, to all 
intents and purposes, identical (and omitted here to save space), 
since the plotted amplitudes are renormalized by the per- source 
average ||A^||. 

The phases, on the other hand, show a marked difference, 
since the formerly dominant effect - that of position offsets - 
has been taken out. The arg A^ solutions themselves are shown 
in Fig. [141 Phase slopes are still very much in evidence, as can be 
seen in Fig. [15] Somewhat surprisingly, these slopes indicate that 
some residual position offsets remain, at a level of 15% to 20% 
of the original offsets (Fig. [16]). This suggests that my procedure 
of fitting phase slopes to arg A^ solutions, followed by fitting 
position offsets to the slopes, systematically underestimates the 
true position offsets. This is possibly an effect of the complex 
averaging implicit in having one A^ solution per a relatively 
large solution interval (20 MHz by 30 minutes). If so, this could 
perhaps be incorporated as a multiplicative correction factor in 
the model update procedure. Further work is required to fully 
understand the effect. 




Fig. 15. Phase slopes over the array as a function of time (in 
deg/km), using improved sky models for the 2003 (top) and 2006 
observations (bottom). The green lines indicate phase slopes cor- 
responding to the fitted position offsets (Fig. [16]). Compare to 
Fig.[IIl 



•b.03" 



Fig. 16. Fitted position offsets corresponding to the phase slopes 
of Fig. [15] (2003 observation on the left, 2006 on the right). 
The length of the arrows is exaggerated by a factor of 1200: the 
biggest offset is in fact just under 0.2''. Compare to Fig. [12] 



The brighter sources B, C, and (to a lesser extent) D show 
clear second-order phase effects, both in the phase slopes, and in 
the phase solutions themselves. The temporal continuity in the 
phase slopes can be interpreted as a time-variable position offset. 
I can speculatively offer two explanations for such an offset: 

- Unmodelled source structure (again!) For any given hour 
angle, an E-W array only sees an integrated cross-section 
through the source in a given direction. If the source is 
slightly resolved with an asymmetric "hotspot", the zero- 
order moment of each such cross-section will be slightly dif- 
ferent. 

- Differential tropospheric or large-scale ionospheric refrac- 
tion, including perhaps apparent change of baseline caused 
by refraction (the Anderson effect). 

Another puzzling feature of the arg A^ solutions in Fig. [14] 
are the significant and (to first degree) constant phase offsets of 
some sources (e.g. H, K, ae) on some antennas. The offsets are 
mostly (though not completely) consistent between the 2003 and 
2006 observations. None of the explanations offered above are 
consistent with a constant phase offset! Could this be the phase 
component of the primary beam? There are too few sources in 
this reduction to infer any sort of directional dependence, but 
perhaps the "QMC Project" can provide more insights on this 
effect. 

2.6. The lurking errors 

The two calibrations (with the original and the improved sky 
models) described above have produced what appear to be iden- 
tical final maps. This shows that the "flyswatter" can accommo- 
date for significant errors in the sky model. On the other hand. 
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Fig. 14. Differential gain-phases (arg AE, in de- 
grees) as a function of time, using improved sky 
models for the 2003 (top) and 2006 (bottom) 
observations. Compare to Fig. [8] 



the detailed structures in Fig. [15] suggest that (even in the very 
benign case of 21 cm WSRT observations!) moderately bright 
off-axis sources still require some form of DDE correction even 
if the model is perfect. If this is the case, then a legitimate ques- 
tion is: why worry about getting the sky model right, if we need 
to do solutions anyway, which will absorb any imperfec- 
tions? (Besides the obvious caveats of the "flyswatter" discussed 
in Sect. [r9l that is.) 



The rather striking image of Fig. [T7] shows that the final 
maps are not in fact identical, although the difference is buried in 
the noise. This image was produced by subtracting the original- 
model 8 -band residual map from the improved-model map (2003 
observation). Since the noise term in both maps is the same, sub- 
traction reveals very faint structures that would normally be hid- 
den in the noise. We're beginning to see more limitations of the 
"flyswatter". In the original reduction, apparent position offsets 
of the off-axis sources caused phase gradients in t, v-space in the 
differential gain-phases. These were approximated by a stepwise 
A^ solution (since I solved for only one A^ term per 30 min- 
utes, per entire band), which proved to be good enough to drive 
off-axis errors to a level below the thermal noise. Improving the 
model positions has effectively "flattened out" these gradients, 
reducing the error made by a stepwise approximation even fur- 
ther. Figure[T7]demonstrates the improvement. The radial spokes 
correspond to "jumps" at the boundaries of the solution intervals, 
but the other structures (especially the half-circles) are rather 
more difficult to explain, and will have to be addressed in follow- 
up work. 

The implications of this result is that any errors in the sky 
model (or uncorrected DDEs) will propagate into the selfcal so- 
lutions, and result in faint but highly coherent structures in the 
residual maps. We may think we are reaching the thermal noise, 
but in the process, we are producing "submerged" calibration 
artefacts at levels below the noise, where we can't even see that 
something is still going wrong ! This is of particular concern to 
ongoing work on detection of the Epoch of Reionization (EOR) 
signature, which relies on statistical analysis of residual images 
to find sub-noise art efacts of astrophysical origin (Marker et al. 
I2OIOI: iMorales et al.l ii2006). Such analysis will have to reckon 
with these lurking selfcal artefacts. 




Fig. 17. Calibration with an improved sky model. This shows 
the difference between the 8-band residual maps (2003 obser- 
vation) produced with the original and the improved sky mod- 
els. Structures around 3C 147 itself and the off-axis sources 
are mostly due to "selfcal contamination" in the original model 
caused by incorrect off-axis source positions. These are well 
within the noise: the intensity range of this image is ±2yuJy, while 
the 8-band maps have a thermal noise of 13.5yuJy. 



3. Conclusions 

One of the biggest selling points of the RIME formalism 
is the flexibility it offers for describing observational effects. 
Unfortunately, to date only three software packages have ex- 
ploited the power of the RIME (CASA, MeqTrees, and the 
LOFAR BBS system). Of these, only MeqTrees allows for truly 
arbitrary forms of the RIME. This paper has explored some prac- 
tical applications of one such form of the RIME: a form that in- 
cludes differential gain terms. I have demonstrated that the dif- 
ferential gain approach (the "flyswatter") can be a powerful way 
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of dealing with DDEs on a source-by-source basis. This has been 
used with WSRT data to produce artefact- free maps of 3C 147 at 
record dynamic ranges of well over a million-to-one. While the 
differential gain solutions themselves absorb inaccuracies in the 
sky model as well as the DDEs themselves, I have demonstrated 
that at least flux and positions corrections can be recovered, so 
iterative improvements to the sky model are possible. 

The latter may also prove be necessary: I have demonstrated 
that even a perfect-looking map produced using difl'erential gains 
contains a large number of selfcal artefacts hidden in the thermal 
noise, which can be significantly reduced by improving the sky 
models. These "invisible" artefacts have hitherto been ignored, 
but they should be of particular concern to projects relying on 
statistical signal extraction, such as the ongoing search for the 
EOR signature. 

The nature of the remaining DDEs (as seen in the diff'erential 
gain solutions) has not yet been adequately explained. Some of 
the amplitude eff'ects are consistent with pointing error. There 
phase behaviour is even more difficult to understand, but may be 
due to unmodelled source structure. Further work is required on 
the subject. 

I have shown that diff'erential gain-phase solutions can be 
used to detect position shifts to within small fractions of the 
synthesized beam size. Off'sets of less than 0.05'' (well under 
0.01 of the PSF size!) have been reliably detected. There is a 
very clear indication of a systematic rotational off'set of ~ 45'' in 
the sky model generated by NEWSTAR, when interpreted using 
MeqTrees. This is may be due to a coordinate conversion error 
somewhere in the visibility data processing tool chain, and needs 
to be investigated further. 

Finally, I should consider some wider implications of my 
results. All currently mooted schem es of DDE calibration for 
LQFA R (iNiiboer & Noordaml '2007). the MWA ( Mitchen etal] 
20081) and the i onosphere in general (Intemaetal. 200^; 
Cotton et a 120041) revolve around the use of "beacon sources" 
to probe the ionosphere and/or the primary beam. It is rather dif- 
ficult to envisage a closed-loop scheme without beacons (how 
else would one sample a DDE?), so future telescopes such as 
the SKA will most likely need to use something very similar. 
Any such scheme predicates on there being a suflftcient number 
of sufl&ciently bright in-beam beacons for any direction on the 
sky. This is not a problem at the LOFAR and MWA end of the 
spectrum, since the low-frequency sky is so much brighter, but it 
has been a bit of a worry for the higher frequencies, where FoVs 
are narrower and sources are fainter. 

My 3C 147 results suggest that calibration beacons can be 
a lot fainter than previously thought. What has been established 
is that for this particular configuration of the WSRT, sources as 
faint as 2 mJy can provide meaningful DDE solutions. This re- 
sult can be scaled to future telescope designs by comparing their 
expected sensitivity with that of the 3C 147 observation. 
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